Trending Articles

article thumbnail

Is the “AI developer”a threat to jobs – or a marketing stunt?

The Pragmatic Engineer

This article was published on 14 March 2024 in The Pragmatic Engineer, for subscribers. I'm sharing this piece in public more than a month later, as it provides important context and analysis for the AI dev tools space. Subscribe to The Pragmatic Engineer to stay up-to-date on what is happening with software engineering, Big Tech, and startups.

article thumbnail

How to build a data team

Christophe Blefari

My personal collection of the best resources to bootstrap a data team and get inspired from what others are doing.

Building 130
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Why did Golang lose to Rust for Data Engineering?

Confessions of a Data Guy

A few years ago I wasn’t sure, who was going to win, Golang seemed to be popular, and still is for that matter. When I first wrote a little Golang (~2+ years ago) I was just trying to see what the hype was all about. The funny thing is, at the time, and today, it […] The post Why did Golang lose to Rust for Data Engineering? appeared first on Confessions of a Data Guy.

article thumbnail

Terms You Should Know If You’re Planning To Use Change Data Capture

Seattle Data Guy

If you’ve worked in data long enough, then you’ve likely come across the term change data capture. Often called CDC, change data capture involves tracking and recording changes in a database as they happen, and then transmitting these changes to designated targets. This can be crucial because some pipelines, in particular batch pipelines, don’t capture… Read more The post Terms You Should Know If You’re Planning To Use Change Data Capture appeared first on Seattle D

Database 130
article thumbnail

How To Get Promoted In Product Management

Speaker: John Mansour

If you're looking to advance your career in product management, there are more options than just climbing the management ladder. Join our upcoming webinar to learn about highly rewarding career paths that don't involve management responsibilities. We'll cover both career tracks and provide tips on how to position yourself for success in the one that's right for you.

article thumbnail

Build Your Second Brain One Piece At A Time

Data Engineering Podcast

Summary Generative AI promises to accelerate the productivity of human collaborators. Currently the primary way of working with these tools is through a conversational prompt, which is often cumbersome and unwieldy. In order to simplify the integration of AI capabilities into developer workflows Tsavo Knott helped create Pieces, a powerful collection of tools that complements the tools that developers already use.

Building 130
article thumbnail

Introducing Confluent Cloud Freight Clusters

Confluent

Confluent Cloud Freight clusters are now available in Early Access. In this blog, learn how Freight clusters can save you up to 90% at GBps+ scale.

Cloud 145

More Trending

article thumbnail

Agile Coach vs Scrum Master: The Difference Stated

Knowledge Hut

Agile methodology is a simple, flexible, and iterative product development model with the distinct advantages of accommodating new requirement changes and incorporating the feedback of the previous iterations over the traditional waterfall development model. Agile methodology is the most popular and dynamic software product development and project maintenance model.

article thumbnail

A Starter Guide to Data Structures for AI and Machine Learning

KDnuggets

This article is an overview of a particular subset of data structures useful in machine learning and AI development, along with explanations and example implementations.

article thumbnail

A Notebook is all I want or Don't

Data Engineering Weekly

The tweet received strong reactions on LinkedIn and Twitter. To clarify, I quoted it as a Notebook-style development, but it is not exactly a Notebook. There is a lot of context missing in that tweet, so I decided to write a blog about it. People have reservations about using tools like Jupytor Notebook for the production pipeline for a good reason.

article thumbnail

Meet the 2024 Snowflake Startup Challenge Finalists

Snowflake

The 2024 Snowflake Startup Challenge began with over 900 applications from startups Powered by Snowflake in more than 100 countries. Our judges narrowed that long list of contenders down to 10, and after much deliberation, they’ve now pared it down to the final three. We are pleased to announce that BigGeo, Scientific Financial Systems and SignalFlare.ai by Extropy360 will advance to the Snowflake Startup Challenge finale and compete for the opportunity to receive a share of up to $1 million in

Media 101
article thumbnail

Navigating the Future: Generative AI, Application Analytics, and Data

Generative AI is upending the way product developers & end-users alike are interacting with data. Despite the potential of AI, many are left with questions about the future of product development: How will AI impact my business and contribute to its success? What can product managers and developers expect in the future with the widespread adoption of AI?

article thumbnail

If:Else Logic and Complexity – Hiding the Pea.

Confessions of a Data Guy

I was recently confronted with an interesting conundrum when writing a complex data pipeline. It was an interesting problem that arose from my quest to reduce complexity in part of the design, which found itself creeping into another part, re-enforcing the classic idea of whether you can really make the complexity pea go away, or […] The post If:Else Logic and Complexity – Hiding the Pea. appeared first on Confessions of a Data Guy.

article thumbnail

How to Stand Out in a Python Coding Interview - Functions, Data Structures & Libraries

Knowledge Hut

Any coding interview is a test that primarily focuses on your technical skills and algorithm knowledge. However, if you want to stand out among the hundreds of interviewees, you should know how to use the common functionalities of Python in a convenient manner. Before moving ahead, read about Self in Python and what is markdown ! The type of interview you might face can be a remote coding challenge, a whiteboard challenge or a full day on-site interview.

Python 98
article thumbnail

The right words in the right place

Tweag

tl;dr You may not believe it, but Nix documentation is getting better. Nixpkgs and NixOS still need more time. Table of contents Overview Motivation Statistics Retrospective Thoughts on future work Acknowledgements Overview This is a retrospective of my and many other people’s work on documentation in the Nix ecosystem between October 2022 and March 2024.

article thumbnail

Calibrating the Mosaic Evaluation Gauntlet

databricks

A good benchmark is one that clearly shows which models are better and which are worse. The Databricks Mosaic Research team is dedicated.

115
115
article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

Avoid These 5 Common Mistakes Every Novice in AI Makes

KDnuggets

Top five mistakes made by AI beginners and practical tips to avoid them, along with an engaging "50-Day Challenge" that you cannot afford to miss.

100
100
article thumbnail

Reaction to Data Engineering Survey for 2024

Confessions of a Data Guy

The post Reaction to Data Engineering Survey for 2024 appeared first on Confessions of a Data Guy.

article thumbnail

Decision Tree Algorithm in Machine Learning: Types, Examples

Knowledge Hut

Machine Learning is an interdisciplinary field of study and is a sub-domain of Artificial Intelligence. It gives computers the ability to learn and infer from a huge amount of homogeneous data, without having to be programmed explicitly. Before dwelling on this article, let's know more about r squared meaning here. Types of Machine Learning: Machine Learning can broadly be classified into three types: Supervised Learning: If the available dataset has predefined features and labels, on which

article thumbnail

Moving Beyond MTEB and BEIR: Snowflake AI Research Joins Forces with the University of Waterloo to Evolve RAG and Retrieval Benchmarks

Snowflake

To accurately answer business questions using LLMs, companies must augment models with their data. Retrieval Augmented Generation (RAG) is a popular solution to this problem, as it integrates the organization’s factual, real-time data into the prompt for the LLM. While the adoption of RAG has increased, an open question remains: How do enterprises know how effective their system is?

Cloud 89
article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Databricks named a Leader in the 2024 Forrester Wave for Data Lakehouses

databricks

We are proud to announce that Forrester has recognized Databricks as a Leader with the highest scores in both current offering and strategy.

Data 108
article thumbnail

Data Science Degrees vs. Courses: The Value Verdict

KDnuggets

Exploring the merits of data science degrees vs courses, this analysis contrasts their depth, prestige, and practicality in job market preparation

article thumbnail

Google Fires Python. What Next?

Confessions of a Data Guy

What is going on? Is the world coming to an end? I thought Python was going to live forever. Well, apparently not at Google. Recently Google announced it was laying off its entire North American-based Python team that was supporting Google’s special needs with Python, in favor of cheaper offshore workers. Apparently, some of these […] The post Google Fires Python.

Python 100
article thumbnail

Why Working Remotely is an Issue with IT Managers?

Knowledge Hut

The work scenario today is stretching workplace flexibilities to accommodate the needs of professionals. Globally stationed offices have also made extending flexible workplaces a norm. Working remotely is the new trend that is transcending industries. While working remotely comes with its own set of benefits, it isn’t well-suited for some industries or professions.

IT 98
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

BazelDay Amsterdam 2024 at Booking.com

Booking.com Engineering

On 25 March, Booking.com joined forces with Engflow Inc to organize and host Amsterdam’s first ever first Bazel Community Day. Booking.com’s brand new headquarters welcomed over 70 attendees from different companies and backgrounds, for an afternoon of connection, talks, and an unconference in which 3 topics were discussed. A productive, insightful afternoon, all capped off with a campus tour and drinks overlooking the Amsterdam skyline from Oosterdok Island. 4 keynote speakers took to the stage

Cloud 90
article thumbnail

Intelligently Balance Cost Optimization & Reliability on Databricks

databricks

The Databricks Data Intelligence Platform offers unparalleled flexibility, allowing users to access nearly instant, horizontally scalable compute resources. This ease of creation can.

article thumbnail

5 MLOps Courses from Google to Level Up Your ML Workflow

KDnuggets

Want to build and deploy robust machine learning systems to production? Start learning MLOps today with these courses from Google.

article thumbnail

Top 8 Snowflake Marketplace Questions, Answered

Snowflake

Snowflake Marketplace is designed to give customers and organizations a place to easily find, try and buy data, apps and AI products that help solve their most pressing business problems. We have more than 540 providers, offering over 2,400 live, ready-to-use data products (as of Jan 31, 2024), so there are many options to help you enrich your own data resources, build new data apps and leverage the power of AI on Snowflake.

article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

Apache Kafka Vs Apache Spark: Know the Differences

Knowledge Hut

A new breed of ‘Fast Data’ architectures has evolved to be stream-oriented, where data is processed as it arrives, providing businesses with a competitive advantage. - Dean Wampler (Renowned author of many big data technology-related books) Dean Wampler makes an important point in one of his webinars. The demand for stream processing is increasing every day in today’s era.

Kafka 96
article thumbnail

DragonCrawl: Generative AI for High-Quality Mobile Testing

Uber Engineering

Learn how Uber improved mobile testing reliability, and increased productivity for thousands of engineers, using machine learning to create DragonCrawl, a highly stable and low-maintenance testing system.

article thumbnail

Executive Overview: The Rise of Open Foundational Models

databricks

Moving generative AI applications from the proof of concept stage into production requires control, reliability and data governance. Organizations are turning to open.

article thumbnail

Containerize Python Apps with Docker in 5 Easy Steps

KDnuggets

Get up and running with Docker with this tutorial on containerizing Python applications.

Python 107
article thumbnail

How Embedded Analytics Gets You to Market Faster with a SAAS Offering

Start-ups & SMBs launching products quickly must bundle dashboards, reports, & self-service analytics into apps. Customers expect rapid value from your product (time-to-value), data security, and access to advanced capabilities. Traditional Business Intelligence (BI) tools can provide valuable data analysis capabilities, but they have a barrier to entry that can stop small and midsize businesses from capitalizing on them.